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Abstract 



In the 1960's Gisbert Hasenjaeger built Turing Machines from electrome- 
chanical relays and uniselectors. Recently, Glaschick reverse engineered the 
program of one of these machines and found that it is a universal Turing 
machine. In fact, its program uses only four states and two symbols, making 
it a very small universal Turing machine. (The machine has three tapes and 
a number of other features that are important to keep in mind when compar- 
I ' ing it to other small universal machines.) Hasenjaeger's machine simulates 

^»«: Hao Wang's B machines, which were proved universal by Wang. Unfor- 

A-^ tunately, Wang's original simulation algorithm suffers from an exponential 

• slowdown when simulating Turing machines. Hence, via this simulation, 

\l Hasenjaeger's machine also has an exponential slowdown when simulating 

Turing machines. In this work, we give a new efficient simulation algorithm 

for Wang's B machines by showing that they simulate Turing machines 

. . with only a polynomial slowdown. As a second result, we find that Hasen- 

^ jaeger's machine also efficiently simulates Turing machines in polynomial 

k> time. Thus, Hasenjaeger's machine is both small and fast. In another appli- 

^"jN cation of our result, we show that Hooper's small universal Turing machine 

^ simulates Turing machines in polynomial time, an exponential improvement. 
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Non-erasing read/write head — ' >*^ W: Non-erasable work tape 

Figure 1: Hasenjaeger's universal Turing machine 

1 Introduction 

In the 1960's Gisbert Hasenjaeger built Turing Machines from electromechanical 
relays and uniselectors, but never published details of these machines. Recently, 
Hasenjaeger's family donated the machine shown in Figure 1 to the Heinz Nix- 
dorf MuseumsForum* . At the request of the MuseumsForum, Glaschick reverse 
engineered the table of behaviour for this machine [1, 2], and, using Hasenjaeger's 
notes [4], determined the machine's encoding and operation. It was found that 
Hasenjaeger's machine simulates Wang's B machines [12]. 

Wang used a unary encoding when proving his B machines universal and hence 
they suffer from an exponential slowdown when simulating Turing machines. As a 
result, Hasenjaeger's machine also suffers from an exponential slowdown. In this 
work, we show that Wang B machines and Hasenjaeger's machine simulate Turing 
machines with polynomial slowdown via the following chain of simulations: 

Turing Machine i— >■ non-erasing Turing Machine t-^ 
Wang B machine i— >■ Hasenjaeger's universal Turing Machine 

where A ^^ B denotes that A is simulated by B. With the exception of the 
Wang B machine simulation of non-erasing machines, all of the simulations in 
the above chain are known to be efficient: non-erasing Turing machines simulate 
Turing machines with a polynomial slowdown in time [13], and Hasenjaeger's 
machine simulates Wang B machines in linear time. We complete the chain of 
efficient simulations by giving a new simulation algorithm that shows that Wang's 
B machines simulate Turing machines with only a polynomial slowdown in the 
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simulated Turing niachinc's time. An immediate consequence of our new algorithm 
is that Hasenjaeger's machine also simulates Turing machines in polynomial time. 
As mentioned above, the simulation of Turing machines by non-erasing Turing 
machines is already known to run with a polynomial slowdown [13]. However, 
to keep our paper self-contained, we give our own polynomial time simulation in 
Section 2. This is followed by our main result in Section 3, where we show that 
Wang B machines simulate non-erasing Turing machines in linear time. So from 
Sections 2 and 3 we get Theorem 1. 

Theorem 1. Let M he a deterministic Turing machine with a single binary tape 
that runs in time t. Then there is a Wang B machine Wm that simulates the 
computation of A4 in time 0{t^). 

In Section 4 we apply Theorem 1 to show that a small universal Turing ma- 
chine of Hooper's [5, 6] is efficiently universal by showing that it simulates Turing 
machines (via Wang B machines) in polynomial time. In Section 5 we give a for- 
mal description of Hasenjaeger's Turing machine and for the sake of completeness 
we show that Hasenjaeger's machine simulates Wang B machines in linear time. 
So from Section 5 and Theorem 1 we get Theorem 2. 

Theorem 2. Let M he a deterministic Turing machine with a single binary tape 
that computes in time t. Hasenjaeger's universal Turing machine simulates the 
computation of A4 in time 0{t^). 

For the remainder of this section we discuss program-size in small universal 
Turing machines. Hasenjaeger's machine has 4 states and 2 symbols, making it 
a remarkably small universal program. However, it uses 3 non-erasable tapes, 
and so making direct comparisons with other Turing machine models that have 
small universal programs (but have more or less tapes, tape dimensions, etc.) is 
not a straightforward matter. The standard model in the small universal Turing 
machine world consists of a single one dimensional tape with one tape head, a 
deterministic program, and the usual notion of a blank symbol [9]. Other more 
general models use larger numbers of tapes, higher tape dimensions, infinitely 
repeated blank words instead of a repeated blank symbol, and so on, and these 
more general models often have smaller universal programs. In the absence of 
formal tools, namely tight program-size overheads for simulations between these 
models, comparisons between them is at best challenging. Glaschick is the most 
recent author to propose a formula to compare such models [3] . 

As an example of the difficulty of comparing different Turing machine models, 
we consider a universal machine by Priese [11]. Priese's universal machine has 2- 
state, 2-symbol, and a single 2 dimensional tape with 2 tape heads. For standard 
2-state, 2-symbol, machines it is known that no universal machines exist as their 
halting problem is decidable [7, 10]. So, by generafising aspects of the model, 
Priese found a universal machine that has provably less states and symbols than 
the standard model. Returning our attention to Hasenjaeger's model, we note 
that while his machine has 3 tapes, the size of his program is still impressive when 
one considers that 2 tapes are read-only and the work tape is non-erasing. 

For more on small universal Turing machines see the surveys [8, 9]. 



2 Non-erasing Turing machines simulate Turing 
machines in time 0{t^) 

Definition 3 (Binary Turing machine) . A binary Turing machine is a tuple M ~ 
(Q, {0, 1}, /, goj 'ZlQl-i)- Here Q and {0,1} are the finite sets of states and tape 
symbols respectively. is the blank symbol, Qq Cz Q is the start state, and <Z|q|-i G 
Q is the halt state. The transition function f is of the form f : Q x {0, 1} -^ 
{0, 1} X {L, R} X Q and is undefined on {q\Q\-i} x {0, 1}. 

We write / as a list of transition rules. Each transition rule is a quintuple 
{qi,xi,X2, D,qj) with initial state qi G Q, read symbol xi G {0, 1}, write symbol 
X2 <£ {0, 1}, move direction D € {L,R}, and next state qj E Q. 

Definition 4 (Non-erasing Turing machine). A non-erasing Turing machine is a 
binary Turing machine where there are no transition rules that overwrite 1 with 
0, that is, there is no transition rule of the form {qj, 1, 0, D, q^), where qj, qk ^ Q 
and D G {L,R}. 

Lemma 5. Let M. be a deterministic single-tape binary Turing machine that 
runs in time t. Then there is a deterministic non- erasing Turing machine Mm 
that simulates the computation of Jv[ in time 0{t^). 

Proof. We give a brief overview of how M is simulated by a deterministic non- 
erasing Turing machine Mm with a single tape in time 0{t^). An arbitrary tape 
of Mi is encoded for Mm as follows. Each symbol on the tape of Mi is encoded 
as three contiguous symbols on the tape of Mm ■ The two rightmost symbols of 
each triple encode and 1 as 10 and 01 respectively. The leftmost symbol of the 
triple is 1 if and only if Mm is simulating that Mi 's tape head is currently reading 
the symbol encoded by the pair immediately to its right. To simulate a timestep 
of Mi , Mm simply makes a new copy of the encoded tape of Mi (to the right of 
the original), by scanning over and back repeatedly. During the copying process 
the encoded tape contents are appropriately modified to simulate the transition 
rule of Mi. This involves simulating the tape head movement of Mi by copying 
the 1 that encodes the tape head position of M to the left of the pair of symbols 
encoding the new read symbol. If we are simulating a rule where M changes a bit 
under its tape head, then the encoded read symbol (i.e. the triple) is appropriately 
changed by Mm ^s it is being copied. The state-changes of Mi can be simulated 
by state-changes of Mm iii ^ straight-forward manner. 

Since M runs in time t, it uses at most t tape cells. Thus, Mm takes 0{t^) 
steps when copying the encoding of an arbitrary configuration of M to simulate 
a single step of A^. So t steps of M are simulated by Mm in time 0{i^). D 

3 Wang B machines 

A Wang B machine is a computing machine with a single non-erasing bi-infinite 
tape that has a binary alphabet [12]. Unlike a Turing machine, which can perform 
three operations in a single timestep (write a 1 to its tape, move its tape head, and 



move program control to a arbitrary location in its program), a Wang B machine 
can perform only one operation at each timestep. Also, in a Turing machine, 
control flow can jump to an arbitrary program location when reading a or a 1, 
but a Wang B machine performs a control flow jump only when reading a 1. 

Definition 6 (Wang B machine). A Wang B machine is a finite list of instruc- 
tions W = Iq, Ii, I2, ■ ■ ■ , In-i where each instruction is of one of the following 
four forms: 

L : move tape head left, 
R : move tape head right, 

M : mark the current tape cell by writing the symbol 1, 
J{x) : if the current cell contains the symbol 1 then jump to instruction I^, 
otherwise move to the next instruction 

Instructions are executed by the machine one at a time, with each computation 
starting at instruction Iq. A left move or right move instruction (/^ € {L,R}) 
moves the head one cell to the left or right on the tape. A mark instruction 
(Jfe = M) marks the tape: if a cell is (unmarked) it becomes 1 (marked), 
otherwise if a cell is 1 it stays as 1. For a jump instruction, Ik = J{x), where 
0<a;<n— 1, if the current tape cell is 1 then the machine jumps to instruction 
Ix. Alternatively, when Ik = J{x) and the current cell is the machine will 
either move to the next instruction Ik+i if fc < n — 1, or it will halt if fc = n — 1. 
After each move or mark instruction 7^, the machine either moves to the next 
instruction Ik+i if fc < n — 1, or halts if fc = n — 1. 

3.1 Wang's B machines simulate non-erasing Turing ma- 
chines in linear time 

Tiieorem 7. Let f\f be a deterministic non-erasing Turing machine with a single 
binary tape that runs in time t. Then there is a Wang B machine WV that 
simulates the computation of Af in time 0{t). 

Proof. We begin by giving the program for the Wang B machine WV followed 
by the encoding it uses to simulate Af. We then show that WV simulates each 
transition rule in Af in constant time, and so simulates the computation of Af in 
time 0{t). 

3.1.1 Encoding 

Let (TRq.,^-^) denote a sequence of Wang B machine instructions that encode the 
transition rule TRq^„-^ — [qi, (Ti, (72, -D, qj) from A/" where g^, qj G Q, ci, (T2 G {0, 1} 



and D e {R, L}. The sequence of instructions for WV is 

WA/- = i?,J(8),(Ti?,„,o>,(ri?9o,i), 

i?,j(2i),(ri?,„o),(ri?,„i), 



i?,J(13z + 8),(ri?,,,o),(™,„i), (1) 



i?, J(i3(|g| - 2) + 8), (ri?„^,_,,o)(ri?„^,_„i), ^ 

where |(5| is the number of states in A/", and (Ti?^. q) and (TRq.i) are the instruc- 
tion sequences given by Equations (2) and (3). 

We now define Equations (2) and (3) which give the sequence of instructions 
used to simulate each transition rule. 



(ri?g„o) = < 



R,M,M,M,M,J{13j) if TRg^^o ^im,0,0, R,qj) 

L,L,L,M,M,J{13j} if ri?,^,o = (<Z»,0,0,L,q,) 

M,R,M,M,M,J{13j) if TRq^^o = (qr, 0,1, R,qj) 

M,L,L,L,M,J{13j) if ri?,^,o-(%,0,l,L,<7,) 



(2) 



,^P , \R,M,M,M,J{l3j) if ri?,„i = fe,l,l,i?,<z,) 

[i,L,L,M,J(13.7) if ri?,„i = fe,l,l,L,g,) 

(There are only two cases for {TRq._i) as non-erasing machines never overwrite a 
1 with a 0.) 

We encode the symbols and 1 of A/" as (0) = 10 and (1) = 11 respectively. 
An arbitrary configuration of M is given by 

Qi, WqWi ... Wj^iWj_Wj + i . . . Wn-1 (4) 

where Qi is the current state, wq . . . w„_i is the tape contents, Wk G {0, 1} and the 
tape head position is given by an underline. The configuration in Equation (4) is 
encoded as the B machine tape 

Ii3i, (wo) (wi) . . . (wj-i) WjjWj^ (wj+i) . . . (w„-i) (5) 

where (wk) G {(0),(1)}, the encoded read symbol Wj^Wj^ — (wj) is given in 
bold, and Iisi is the next instruction to be executed and encodes that Af is in 
state Qi. Note that Ii^i is the first instruction in the sequence /i3i, . . . , /i3i+i2 = 
R, J{13i + 8), {TRq.fi), (TRq.^i) that encodes the pair of transition rules for state 

Qi)- 

The infinite number of blank tape cells oi Af each contain the symbol 0, as do 
the blank tape cells of WV- Note that, during the simulation, WV may need to 
simulate the situation where the tape head of Af moves to a blank tape cell. In 
this case, as described below, the simulator WV will move to the relevant blank 
portion of its own tape and convert the symbol pair 00 to 10 = (0). 



3.1.2 Simulating transition rules 

At the start of each simulated timestep of machine A/", our Wang machine W/v has 
a configuration of the form given in Equation (5). Each simulated timestep begins 
with VW choosing which transition rule to simulate by reading the encoded read 
symbol and then choosing which sequence {{TRg^fi) or {TRq.^i)) to execute. 

From Equation (5), each simulated timestep begins with the tape head over 
the leftmost symbol of the encoded read symbol ((0) = 10 or (1) = 11). So, 
immediately after we execute the first instruction (which is I\^i = R) the tape 
head is over the rightmost symbol of (0) or (1) and the program control is at 
instruction Im+i = J(13z + 8). If we are reading (0) = 10, then the rightmost 
symbol is a and so no jump occurs on J(13z + 8). This means that control moves 
to instruction /i3i+2j the leftmost instruction in {TRq.fi). Alternatively, if we are 
reading (1) = 11, then the rightmost symbol is a 1 and J(13i + 8) will jump to 
instruction /isi+si sending control to the leftmost instruction of (TRq.^i). (To see 
this, use Equation (1) to count the number of instructions that precede {TRq.,i), 
which gives 13i + 8, specifically 13 instructions for each state qj where j < i and 
a further 8 instructions for the sequence R, J(13z + 8), (TRq^fi).) 

We now explain how the sequences in Equations (2) and (3) simulate the 
transition rules of Af. 

Case 1: read symbol of A/" is 1. As described at the beginning of Section 3.1.2, 
the simulation of each timestep begins with the execution of R, J(13i + 8). When 
the read symbol oi Af is 1, and the pair of instructions R, J(13i + 8) have executed, 
we have the following tape contents for WV 

(wq) (wi) . . . {wj-2) 10 11 (wj+i) . . . (w„-i) (6) 

(For illustration purposes, we assume that in Af the symbol to the left of the read 
symbol is a 0, which is encoded as (0) = 10 in Equation (6).) 

As described at the beginning of Section 3.1.2, when the read symbol oi Af 
is 1 the execution of R, J(13i + 8) is followed by the execution of the sequence 
(TRq.^i). If we are simulating (g^, 1, 1, L, qj), then from Equation (3) the instruc- 
tion sequence {TRq.^i) = L, L, L, M, J(13j) is applied to the tape in Equation (6) 
to give 

(wq) (wi) . . . {Wj^2) 10 11 (Wj + l) . . . (w„-i) (7) 

The tape in Equation (7) is of the form in Equation (5), hence the tape configura- 
tion is ready for the simulation of the next timestep. The jump instruction J(13j) 
sent the program control of HV to the first instruction of the sequence instruc- 
tions that encodes state qj . This is verified by counting the number of instructions 
to the left of R, J(13j -|- 8), (TRq.^), {TRq.^i) using the same technique as above. 
So, the simulation of the transition rule (g,, 1, 1, L, qj) is complete. 

To generalise this example to all possible cases for simulating a rule of the form 
(gi, 1, 1, L, qj) we need only consider the encoded symbol (from Af) immediately 
to the left of the encoded read symbol (in our analysis i,j are already arbitrary). 
If the encoded symbol to the left of the tape head in Equation (6) was (1) = 11 
instead of (0) = 10, then it is verified in the same straightforward manner. If we 
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are simulating the situation where M is at the left end of its tape (the tape is 
blank to the left: all Os) and so contains the pair 00 immediately to the left of 
the encoded read symbol in Equation (6). This 00 pair is changed to (0) = 10 
by the M instruction that immediately proceeds the J(13j) instruction, correctly 
providing the symbol pair 10 that encodes the as (0) = 10 for the next simulated 
timestep. Also, the 1 printed by this M instruction allows instruction J(13.j) to 
jump the program control to the encoding of the next state qj. 

The case of simulating (g^, 1, 1, _R, gj) is verified by applying the sequence 
(TRg.^i) = R,M,M,M,J{15j) from Equation (3) to the tape in Equation (6). 
This analysis is similar to the previous example and so we omit the details. We 
simply note that after the first of the three M instructions is executed, the tape 
cell will always contain a 1 and so the second and third M instructions do not 
change the tape. (These extra M instructions are used for padding so that each 
encoded state has exactly 13 instructions.) 

Case 2: read symbol of A/" is 0. As described at the beginning of Section 3.1.2, 
the simulation of each timestep begins with the execution of R, J{13i + 8). When 
the read symbol of Af is 0, and the pair of instructions R, J(13i + 8) have been 
executed, we have the following tape contents for WV- 

(wo) (Wl) . . . (Wj-1> 10 11 (Wj+2) ■ ■ ■ (Wn-l) (8) 

(For illustration purposes, we assume that in M the symbol to the right of the 
read symbol is a 1, which is encoded as (1) = 11 as in Equation (8).) 

As described at the beginning of Section 3.1.2, when the read symbol of A/" 
is the execution of R, J(13i + 8) is followed by the execution of the sequence 
(TRq.fi). If we are simulating (g^, 0, 1, R, qj), then from Equation (2) the sequence 
{TRg.^o) = M, R, M, M, M, J(13j) is applied to the tape in Equation (8) to give 

(wo) (wi) . . . (Wj-l) 11 11 {Wj+2) ■ ■ ■ (Wn-l) (9) 

The first M instruction changed (0) — 10 to (1) — 11 simulating the printing 
of the write symbol by Af. The tape in Equation (9) is of the form found in 
Equation (5) and is ready for the simulation of the next transition rule to begin. 
The jump instruction J(13j) sends the program control of VVV to the instruction 
sequence of the program that encodes state Qj. This is verified using the same 
technique as in the previous case. So, the simulation of (q^, 0, 1, i?, Qj) is complete. 
To generalise this example to all possible cases for simulating a rule of the form 
{qi, 0, 1, i?, qj), we need only consider the encoded symbol (from Af) immediately 
to the right of the encoded read symbol (in our analysis i, j are already arbitrary). 
If the encoded symbol to the right of the tape head in Equation (8) was (1) = 10 
instead of (1) = 11, then it is verified in the same straightforward manner. If we 
are simulating the situation where Af is at the right end of its tape (the tape is 
blank to the right: all Os) and so contains the pair 00 immediately to the right of 
the encoded read symbol in Equation (8). This 00 pair is changed to (0) = 10 by 
the second M in the sequence M, R, M, M, M, J(13j) which provides the symbol 
pair 10 — (0) that correctly encodes a for the next simulated timestep. Also, the 



1 printed by this M instruction allows instruction J(13j) to jump the program 
control to the encoding of the next state qj . As with the previous case, the extra M 
instructions are added for padding. 

The other cases for simulating M reading a are verified by applying the 
appropriate sequences from Equation (2) to the tape in Equation (8). The details 
are similar to the previous example and are omitted. 

3.1.3 Halting and time complexity. 

When M enters its halt state, defined to be state q\Q\-i in Definition 3, then 
Waa executes the jump instruction J(13(|Q| — 1)) and jumps to the rightmost 
instruction in Equation (1), an M instruction. Note that in order to jump to 
this M instruction we must have read a 1 on the tape, and so this M does not 
change the tape. After executing this M instruction, WV is at the end of its list 
of instructions and so it halts. 

From Equations (1), (2) and (3), exactly 13 instructions are used to encode 
the pair of transition rules for each state g^ of A/". Furthermore, from the above 
algorithm, the simulation of one of these transition rules involves the execution 
of at most 8 instructions (at most 8 timesteps) . Thus WV simulates t steps of an 
arbitrary non-erasing Turing machine M in time 0{t). D 

4 Hooper's small universal Turing machine sim- 
ulates Turing machines in polynomial time 

Hooper [5, 6] gave a small universal Turing machine with 1 state, 2 symbols and 
4 tapes. Using similar techniques to Hasenjaeger, Hooper proved his machine 
universal by simulating a restricted class of Wang B machines. In Hooper's ma- 
chine, a non-erasing work tape contains exactly the same contents as the tape 
of the Wang B machine it simulates, and a read-only unidirectional circular pro- 
gram tape stores the encoded Wang machine program. Hooper used a relative 
addressing technique like Hasenjaeger, but unlike Hasenjaeger, Hooper used two 
read- write counter tapes (instead of one read-only tape). Hooper's simulation of 
Wang B machines runs in linear time, and so from Lemma 5 and by suitably 
modifying the proof of Theorem 7 we get the following result. 

Theorem 8. Let Ai he a deterministic Turing machine with a single binary tape 
that runs in time t. Then Hooper's small universal Turing machine with 1 state, 

2 symbols, and 4 tapes [5, 6] simulates the computation of A4 in time 0{t^). 

Proof. Hooper's machine simulates Wang B machines with the following restric- 
tions: 

1. In the program list if I^. — J{x), then Ik+i G {L, R\. 

2. Each jump instruction jumps to {L, i?}. 

3. M instructions are executed only on tape cells that contain 0. 



Our proof of Theorem 7 is easily modified to include the above restrictions. For 
restriction 1, we add the instruction sequence L, R after each jump instruction in 
the program. This has no effect on the program as a move left followed by a move 
right has the same effect as no move. Our proof already satisfies restriction 2, 
as we either jump to the beginning of the sequence encoding a state qi (that is: 
R, J(13i + 8), {TRq.fi), {TRq.^i}) or we jump to the beginning of a sequence of 
the form {TRq.,i) (given in Equation (3)). To satisiy restriction 3, each Ik = M 
instruction is replaced with the sequence J(fc + 4), i?, L, M, R, L. The J(fe + 4) 
will jump over the M instruction if the cell already contains a 1, and the extra 
R, L instructions are introduced to satisfy restrictions 1 and 2. 

In addition to the above changes, we wish to maintain the property from the 
proof of Theorem 7 that the number of instructions used to encode each Turing 
machine state is the same for all states. Recall from Theorem 7 that each state 
qi is encoded as the sequence of 13 instructions R, J{13i + 8), {TRq.fi), {TRq.^i). 
This sequence has 3 jump instructions and to satisfy restriction 1 we added the 
extra instruction pair L, R for each jump. For restriction 3, we replaced each M 
instruction with J{k + 4), i?, L, M, R, L. In Equation (3) this gives an extra 15 in- 
structions for the case {qi, 1, 1, R, qj) and an extra 5 for the case {qi, 1, 1, L, qj). To 
ensure that the instruction sequence is the same length for each case we append the 
lcngth-10 sequence {L, R, )^ to the sequence for case {qi, 1, 1, R, qj). Satisfying re- 
striction 3 in Equation (2) gives an extra 20 instructions for the cases {qi, 0, 0, R, qj) 
and {qi, 0, 1, R, qj), and an extra 10 for cases {qi,0, 0, L, qj) and qi, 0, 1, L, qj. To 
ensure that the instruction sequence is the same length for each case we append 
the length-10 sequence {L,R,)^ to the sequences for case {qi,0,0,L,qj) and case 
{qi, 0,1, L, qj). Now the length of the sequence that encodes each state is 54 (in- 
stead of 13), and so we replace jumps of the from J(13i) with jumps of the from 
J(54z). The sequence R, J{13i + 8), {TRq.fi) of length 8 has been replaced by a 
sequence of length 32, and so we replace jumps of the from J(13J + 8) with jumps 
of the form J(54i-|-32). This completes our conversion to a Wang B machine with 
the 3 restrictions mentioned above. D 

5 Hasenjaeger's electromechanical universal Tur- 
ing machine 

We begin this section by briefly describing the electromechanical device con- 
structed by Hasenjaeger [4], which implements a multi-tape Turing machine. As 
mentioned in Section 1, Glaschick [3] reverse engineered the physical wiring of 
Hasenjaeger's electromechanical machine to find the Turing machine program left 
by the previous programmer, presumably Hasenjaeger, and with the help of Hasen- 
jaeger's notes saw that it simulates Wang B machines. For completeness we include 
a proof that this program (wiring) for Hasenjaeger's machine simulates Wang B 
machines in linear time.^ 

First we briefly describe Hasenjaeger's electromechanical machine, which is 
shown in Figure 1. 

tNote that the machine can be rc-progranimcd by re- wiring. 
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The control unit is constructed from 16 electromechanical relays which en- 
code the main program (also called the state table) of the Hasenjaeger ma- 
chine. This unit is limited to 4 states and operates on three tapes. 

The program tape (P) is constructed using 20 switches, 18 of which are 
connected, representing a cyclic, bi-directional, read-only binary tape with 
18 cells. (This tape can be used to store a simulated program.) 

The counter tape (C) consists of two selector switches that represent a bi- 
directional, cyclic, read-only tape with 18 cells. It represents a tape where 
all cells contain a 1 except for a single cell that contains a 0. 



The work tape (W) is a bi-directional non-erasable "infinite" tape 



t 



Hasenjaeger's electromechanical device, as wired, is an instance of a Turing 
machine. However, exactly what kind of Turing machine is a matter of opinion: 
there are a number of reasonable generalizations of this single device (machine 
instance) to get a general model of computation, here we give one. Formally, we 
write the tuple {Q,f,qs) to denote an instance of a three-tape Turing machine 
of the following form. The three tapes are bi-directional and are denoted P, C 
and W. Each tape has alphabet {0, 1} and blank symbol 0. Tapes P and C are 
read-only, while W is non-erasing (i.e. Is can not be overwritten with Os). To 
give an instance of such a machine, we would assign values to the tuple (Q, /, Qs), 
where Q is a, set of states, / is a transition function (or transition table), of the 
form / : g X {0, 1} X {0, 1} x {0, 1} -^ {L, R, _} x {L, R, _} x {L, R, _, 1} x Q, and 
<7s G Q is the start state. 

The machine works as follows. In state q G Q, the machine reads a symbol 
from each of the tapes P, C, and W and, as dictated by /, for each tape does one 
of three things: move left (L), move right (i?), do nothing (_). However, for the 
tape W it has an additional fourth option of marking (M) the tape cell with the 
symbol 1. 

Now we formally specify Hasenjaeger's machine "H = {Q, /, Qs) as an instance 
of the above model. "H has four states Q — {qi,q2,Q3,Q4:} and the start state is 
Qs = qi- The function / is given as a list of transition rules in Table 1. This table of 
behaviour is derived from the wiring of the electromagnetic relays of Hasenjaeger's 
device. 

Lemma 9. Let W be a Wang B machine that runs in time t. The multitape 
Turing machine %, defined above, simulates the computation ofW in time 0{t). 

Proof. We begin by giving the encoding used by the program W, followed by 
a description of how the program simulates each of the four Wang B machine 
instructions as well as halting. We finish by giving the time analysis for this 
simulation. 



■fit is expected that the recent precipitous decline in the production of 35min film and paper 
punch tape will negatively impact the computing power of Hasenjaeger's machine. 
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Table 1: The program / for Hasenjaeger's universal machine "H that simulates 
Wang's B machines. The * symbol denotes that the read symbol can be or 1. 
The rule numbers on the left are not part of the program. 



Encoding. The four Wang B machine instructions M, R, L and J{x) are 
encoded as binary words as follows: (M) — 1, (i?) = 01, (L) = 001, and 
(J(x)) = 0000«1 (the value y € {0,1,2,...} wih be defined later). The Wang 
B machine program W = Io,Ii,. . . ,/„_i is encoded as a single binary word via 
Equation (10). 



(W) = (/o)(/l)(/2) . . . {In-2){In-l){J{n)) 



(10) 



The word (W) e {0, 1}* is placed on H's circular program tape P. The C tape 
is defined to have length n + 2, with n + 1 of these cells containing the symbol 1, 
and the single remaining cell containing the symbol 0. The W tape has the same 
tape contents as that of the Wang B machine it simulates. At the beginning of 
a simulated computation step the tape head of P is over the leftmost symbol of 
the encoded instruction it is simulating, C"s tape head is over its single symbol, 
and the tape head of W has the same location as the tape head of the Wang B 
machine it simulates. 

To help simplify our explanation, we give partial configurations for "H where 
we display a small part of each tape surrounding the tape head. For example, the 
following configuration occurs at the beginning of a simulated computation step 



91 



P = 



1001. 



c = 



101. 



M^ == . . . 100 . 
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Here, "H's current state is qi and the position of each of the three tape heads is 
given by an underhne. Also, in the above example the tape head of P is over the 
leftmost symbol of an encoded left move instruction (L) = 001, and the C tape 
head is at cell Cq. 

Simulate M instruction. The Wang B machine M instruction is encoded as 
(M) = 1 on the P tape. If the tape head of W is reading a then we have a 
configuration of the form 

qi P==...11001... C = ...101... T4^ = ...100... 

(For the purposes of explanation, wc have assumed that there is an encoded L 
instruction, given by (L) — 001, to the right of (M) = 1 on the P tape.) Rule 1 
from Table 1 is applied to the above configuration to give 

qi P = ...11001... C = ...101... T4^ = ...110... 

The M instruction was simulated by printing a 1 to the W tape. Note that the 
tape head on the P tape has moved to the leftmost symbol of the next encoded 
instruction ((i) ~ 001), and the current state of "H is once again qi. So the 
simulation of the M instruction is complete and "H is configured to begin simulation 
of the next Wang machine instruction. 

In the case where the tape head of W is reading a 1, we simulate the M 
instruction by executing rule 2 from Table 1. This is very similar to the previous 
case above and so we omit the detail. 

Simulate R instruction. The Wang B machine right move instruction is en- 
coded as (i?) = 01 on the P tape. If the tape head of W is reading a then we 
have a configuration of the form 

gi P = . . . 1 01 001 . . . C = . . . 101 . . . W ^ ...100... 

Rules 3 and 4 from Table 1 are applied to the above configuration to give 

gi P = ... 101 001... C==...101... W = ...lOO... 

The tape head of W was moved one place to the right to simulate the R instruction. 
Also, the tape head on the P tape has moved right 2 places to the leftmost symbol 
of the next encoded instruction ((P) = 001), and the current state of H is once 
again qi . So the simulation of the M instruction is complete and "H is configured 
to begin simulation of the next Wang machine instruction. In the case where the 
tape head of W is reading a 1, the computation proceeds in the same manner as 
above by executing rules 3 and 4 from Table 1. 

Simulate L instruction. The Wang B machine left move instruction is encoded 
as (L) — 001 on the P tape. If the tape head of W is reading a then we have a 
configuration of the form 

gi P = ... 1001 01... C=...101... W^...100... 
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Rules 3, 5 and 6 from Table 1 are applied to the above configuration to give 

gi P = ... 1001 01... C=...101... VK = ...100... 

The tape head of W was moved one place to the left to simulate the L instruction. 
Also, the tape head on the P tape has moved right 3 places to the leftmost symbol 
of the next encoded instruction ((i?) = 01), and the current state of H is once 
again qi . So the simulation of the M instruction is complete and "H is configured 
to begin simulation of the next Wang machine instruction. In the case where the 
tape head of W is reading a 1, the computation proceeds in the same manner as 
above by executing rules 3, 5 and 6 from Table 1. 

Simulate Ik = J{x) instruction. There are two cases to consider here, which 
are determined by the value of read symbol of the simulated Wang B machine. 

Case 1. Wang B machine's read symbol is 0. In this case, "H simulates program 
control for W moving from instruction Ik to instruction Ik+i- This is simulated by 
moving the tape head to the leftmost symbol of (Ik+i)- Instruction Ik = Jix) is 
encoded as {Ik) = {J{x)) ~ 0000^1 for some y € {0, 1,2,.. .} {y is defined below), 
and for the purposes of explanation we assume that Ik+i = L. This gives the 
configuration 

qi P = ...100002'1001... C=...01... W = ...Q... 

After applying rules 3, 5 and 7 from Table 1 we get the following 

93 P= ...lOOOOO^^HOOl... C = ...01... W = ...Q... 

Next, rule 8 is applied y times followed by a single application of rule 9 to give 

qi P = ...100002'1001... C=...01... W^...Q... 

In the configuration immediately above, the simulation of J{x) when the Wang 
machine read symbol is is complete. Note that % has returned to state qi 
and the tape head of P is over the leftmost symbol of the encoded instruction 
(/fc+i) = 001. 

Case 2. Wang B machine read symbol is 1. In this case, simulating the 
instruction Ik ~ J{x) involves moving the P tape head to the leftmost symbol of 

We begin with an overview, which includes specifying the encoding of jump 
instructions. Each encoded instruction contains a single 1 symbol, and so as we 
move through the P tape we can count the number of encoded instructions by 
counting the number of 1 symbols. If a; ^ A:, then, from Equation (10), we can 
move from {Ik) to {Ix) on P by moving left until we have read the symbol 1 
exactly fc — x + 1 times, and then moving right. Recall that the P tape is circular, 
and so if a; > fc, using Equation (10), we move from {Ik) to {Ix) on P by moving 
left until we have read the 1 symbol exactly (n + 1 + fc — x) times. We are now 
ready to give the encoding for jump instructions. 

(4) = ( j(x)) = oooon 
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where 

k — X if X ^ fc 

n + 1 + k — X if X > k 



(11) 



In the simulation, moving from (/j.) to (I^) is done in 2 stages. In the first 
stage the word {J{x)) — 0000^1 is read and the value y + l is recorded by the tape 
head position on the C tape. In the second stage, using the value stored on the C 
tape, the tape head of P moves left until we have read the symbol 1 exactly y + l 
times. So, the tape head of P finishes its scan left immediately to the left of the 
1 in (/x-i)j from there it moves right two cells to the leftmost symbol of (Ix)- 

Now we give the details of how "H simulates a jump from instruction I^ to 
instruction I^ ■ For the purposes of illustration we assume the instruction to the 
left of Ik is /fc-i = L. This gives the configuration 

qi p = ...ooioooon... C = ...OP+^.. W = ...!... 

First, rules 3, 5 and 7 from Table 1 are applied, and then rule 10 is applied y 
times, followed by a single application of rule 11, to give 

Qi ...p = 001 0000^^^01... c==...oin... t4^ = ...i... 

In the configuration immediately above the value y + 1 is recorded by the position 
of the tape head of C, which is over y + 1*^ symbol to the right of the single 
symbol. Rule 12 is applied y + 3 times to give 

94 ...p = 001 0000^1... c = ...oin... w^...i... 

When 1 is read on tape P the value stored on tape C is decremented by moving 
left once on C using rule 13. This gives 

Qi ...P^OOlOOOO^'l... C = ...Ol^^^ll... T4^ = ...l... 

The above process of decrementing the value stored in C by applying rules 12 and 
13 continues until the tape head of C reads a 0, indicating that the scan left is 
finished (during this process Rule 13 is applied a total of y + 1 times). At this 
point we have a configuration that is of one of the following two forms 



94 


P= . 


..01... 
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. . 01 . . . 
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■ ■I 


94 


P= . 


..11... 
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■ ■I 



Rule 13 was applied y + l times reading a 1 each time. Rules 14 and 2 are applied 
to move the tape head of P right twice, placing it over the leftmost symbol of 
instruction {Ix) to complete the simulation of Ik = J{x). 

Simulation of halting. Recall from Section 3, that a Wang B machine halts 
when it attempts to move to the non-existent instruction /„ after executing in- 
struction In-i- Since % does not have a distinguished halt state, it instead sim- 
ulates halting by entering a repeating sequence of configurations. Note that in 
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Equation (10), as part of the Wang B machine encoding, there is an extra instruc- 
tion (/„ = J(n)) that jumps to itself. So when program V. simulates a Wang B 
machine that halts by attempting to move to instruction /„, the program simu- 
lates the instruction J{n) which results in an infinite loop and signals the end of 
the simulation. (The jump instruction works as intended only if we have the as- 
sumption that the cell under W's tape head reads 1; it is easy to modify any Wang 
B program so that this is the case by having the program end with a single mark 
instruction, i.e. /„_i — M). This jump works as follows. From Equation (11), a 
jump instruction of the form /„ = J(n) is encoded as (/„) = {J{n)) = 0001. This 
gives the configuration 

qi P = ...0001... C = ...01... W = ...l... (12) 

From here, H simulates the jump instruction /„ = J{n), as described above. In 
this simple case, simulating the jump instruction involves executing exactly 10 
rules (see Table 1) after which H returns to configuration (12). Hence we get an 
infinite loop where the tape contents are unchanged. 

Complexity analysis. The Wang B machine instructions M, R, and L are 
each simulated by Hasenjaeger's machine in 1, 2 and 3 timesteps, respectively. 
The J(x) instruction is simulated in 0{n^) timesteps, where n is the number of 
instructions in the Wang B machine program. Note that we consider n to be a 
constant, independent of the input length. Therefore, Hasenjaeger's program "H 
simulates t steps of the Wang B machine W in time 0{t). D 
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